Sep 24 07 1 1 : 57p Humphrea 



15137720052 



p. 8 



RECEIVED 
CENTRAL FAX CEWTBR 

SEP 2 4 2007 



I'ATENT 

ATTY. DOCKET NO. IBM/265 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

BEFORE THE BOARD OF PATENT APPEALS 
AND INTERFERENCES 

Ex parte Abdo Esmail Abdo ct al. 

Appeal No. 



Serial No.: 
Filed: 

Group Art Unit: 
Examiner: 
Applicant: 
Title: 



Cincinnati, Ohio 45202 



10/758,486 
January 15, 2004 
2162 

Giovanna B. Colan 
Abdo Esmail Abdo et al. 

GENERATING STATISTICS ON TEXT PATTERN 
MATCHING PREDICATES FOR ACCESS PLANNING 

September 24, 2007 
Via Facsimile 



APPEAL BRIEF 

This brief is in furtherance of Applicant's Notice of Appeal filed July 23, 2007 , 
appealing the decision of the Examiner dated February 23, 2007 finally rejecting claims 1-23. 
A copy of the claims appears in the Appendix to this brief. 



09/26/2007 WASFAU1 00000079 10758446 
01 FC:1402 500.00 DA 



Certificate or Facsimile or Mail Transmission 

I hereby certify that this correspondence and any enclosures are being 
deposited with the United States Postal Service via U.S. Mail, Prepaid First 
Class Postage or transmitted via facsimile to the number as provided by the 
United States Patent and Trademark Office on the dale indicated below. 



/Thomas W. Humphrey/ 



September 24. 2007 



Thomas W. Humphrey 
Reg. No. 34.353 



Date 



PACE 8/28 * RCVD AT 9/24/2007 11:50:46 PM [Eastern Daylight Time] ■ SVR:USPTO-EFXRP-2/1 0 " DNIS:2738300 " CSID:1 5137720052* DURATION (mm-ss):08-00 



Sep 24 07 ll:57p Humphrey 



15137720052 



Real Party Tn Interest 

The real party in interest in this appeal is INTERNATIONAL BUSINESS 
MACHINES CORPORATION, a corporation of New York having a place of business 
Orchard Road, Armonk,New York 10504. 
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Related Appeals and Interferences 

There are no such appeals or interferences. 



-3- 



PAGE 10/28 - RCVD AT 9/24/2007 11:50:46 PM [Eastern Daylight Time] * SVR:USPTO-EFXRF-2/19 * DNIS:2738300 * CSID:15137720052 - DURATION (mm-88): 08-00 



Sep 24 07 ll:58p Humphrey 



15137720052 



p. 1 1 



Status of Claims 

Total Number of Claims in the Application 

Claims in the application are: 23 

Status of all the Claims 

1. Claims canceled: NONE 

2. Claims withdrawn from consideration but not canceled: NONE 

3. Claims objected to: NONE 

4. Claims allowed or confirmed: NONE 

5. Claims rejected: 1-23 

Claims on Appeal 

The claims are appeal are Claims 1-23. The claims in this application have never been 
amended. 
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Status of Amendments 

An Amendment of Appeal filed with this brief, correcting typographical errors and 

omissions in the specification, is pending. 
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Snmrnan of Claimed Subi r ^ Matter as to Independent Claim } 

Independent Claim 1 is described in the specification on page 9, line 194 to page 12, 
line 1 1, and Figs. 4-5, reference numbers 200-350 of the drawings. 

These pages disclose examples of the invention recited in claim 1, which describes a 
method that utilizes "a data storage structure storing character statistics on an attribute, 
comprising a first structure storing, for each of a plurality of character positions, frequently 
occurring characters in that character position" This structure is used in conjunction with a 
"pattern matching predicate" by the use of "character positions of characters in [the] pattern 

matching predicate." 

Thus, the invention is directed to the storage of statistical information regarding 
characters in an attribute of a database relation, and the use of those statistics in evaluating a 
pattern matching predicate. 
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Summary of Claimed Subject Matter as to Independent Claim 11 

Independent Claim 1 1 is directed to a computer system, such as that described in the 
specification on page 9, line 194 to page 12, line I L and Figs. 4-5, reference numbers 200- 
350 of the drawings. 

These pages disclose examples of the invention recited in claim 1 1 , which describes a 
computer system stores, for a relational database "a data storage structure storing character 
statistics on an attribute, comprising a first structure storing, for each of a plurality of 
character positions, frequently occurring characters in that character position". This structure 
is used in conjunction with a "pattern matching predicate" by the use of "character positions 
of characters in [the] pattern matching predicate." 

Hius, the invention of claim 1 1 is also directed to the storage of statistical information 
regarding characters in an attribute of a database relation, and the use of those statistics in 
evaluating a pattern matching predicate. 
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Summary of Claimed Su b ject Matte r as to Independent Claim 21 

Independent Claim 21 is directed to a program product, implementing a relational 
database system such as described in the specification on page 9, line 194 to page 1 2, line 1 1 , 
and Figs. 4-5, reference numbers 200-350 of the drawings. 

These pages disclose examples of the operation of a program product such as recited in 
claim 21 , in which there is "a data storage structure storing character statistics on an attribute, 
comprising a first structure storing, for each of a plurality of character positions, frequently 
occurring characters in that character position". Relational database software uses the 
structure in conjunction with a "pattern matching predicate" by the use of "character positions 
of characters in [the] pattern matching predicate." 

Thus, again, claim 21 is directed to the storage of statistical information regarding 
characters in an attribute of a database relation, and the use of those statistics in evaluating a 
pattern matching predicate. 
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Grounds of Rejection 

Whether the subject matter of any of claims 1-23 is obvious in light of Fritchman U.S. 

Patent 6,785,677 (Fritchman) in view of Haas et al. 
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Argument 

The Examiner's rejection is premised upon tine assertion that the Fritchman '677 
patent discloses storing character structures that meet the claim language. The claim language 
recites a structure in which, 'Tor each of a plurality of character positions" the structure 
identifies "frequently occurring characters in that character position, and statistics for each 
frequently occurring character". Fritchman does not disclose this. Fritchman, in the cited 
passages, only discloses how to describe a character string criterion for a query, indicating that 
strings should be described using a prefix segment, a suffix segment, and one or more interior 
segments. 

The Examiner has cited to passages in col. 8 of Fritchman regarding a "bit vector 
noting the position of any single-character wild cards" (that can be included in a LIKE 
predicate of a query). The Examiner seems to believe that the representation of a LIKE 
predicate using wildcard characters, is somehow the same as a statistical representation of 
frequently occurring characters in particular character locations in an attribute. They are 
clearly not the same. A wildcard is a way to identify a pattern of characters that match a 
criterion, not a way of identifying, for an attribute of a relation, frequently occurring 
characters in a character position. Indeed, Fritchman does not disclose the generation of any 
such statistical information, or the use of the same, but rather discloses a matching 
methodology in which a "pattern is applied to each database value in ton, where the value is 
the result of evaluating the first operand of the LIKE predicate for each candidate row 
retrieved from the database " Clearly, the application of a pattern to every database value to 
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determine exactly how many matches exist is entirely different from using characterizing data 

to estimate a number of matches. 

The Examiner asserts the Haas paper discloses storing character statistics on an 
attribute, and statistics for each frequently occurring character. However, the Examiner is 
incorrect here as well. The Examiner asserts that page 528 of Haas discloses "storing 
character statistics on an attribute" and "statistics for each frequently occurring character". 
Applicant is unable to identify any such disclosure. Specifically, the text quoted in Haas is 
directed to sampling-based selectivity estimation, using what are called 'augmented frequent 
value' (AFV) statistics. While selectivity estimation and the use of frequent value statistics is 
somewhat relevant to the present invention, Applicant has been unable to identify any 
disclosure in the cited paragraphs of the creation of an index or statistic for text matching, that 
stores "for each of a plurality of character positions, frequently occurring characters in that 
character position", as claimed. The quoted paragraphs on page 528 only generally discuss 
the creation of AFV statistics and do not detail a particular form of such statistics for text 
matching, much less the particular form that is recited in the present claims, namely, "for each 
of a plurality of character posiuons, frequently occurring characters in that character position." 

As the Examiner has not identified how either cited reference relied upon in the 
rejection discloses a index or statistic meeting the claim recitations, namely, "for each of a 
plurality of character positions, frequently occurring characters in that character position," 
Applicant submits that all claims presented herein are allowable and requests the issuance of a 
Notice of Allowability. 
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The Examiner attention is directed to an Amendment for Appeal attached hereto which 
corrects typographical errors noted by Applicant during the preparation of this appeal brief. 
Applicant submits that these amendments do not introduce new matter or new issues and 



In accordance with the foregoing. Applicant submits that the Examiner's 



rejection is in error and a reversal of the rejection and allowance of the claims is therefore 



2700 Carew Tower 
441 Vine Street 
Cincinnati, OH 45202-2917 

Voice: (513) 241-2324 
Facsimile: (513) 241-6234 



respectfully requests their entry. 



requested. 



Respectfully submitted, 
Wood, Herron & Evans, L.L.P. 




Thomas W. Humphrey 
Reg. No. 34,353 
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Claim Appendix 

1 . (original) A method for generating a statistic for a pattern matching 
predicate on an attribute of a relation, to be used in optimizing execution of a 
query directed to one or more attributes of said relation, compri sing 

providing a data storage structure storing character statistics on said 
attribute, comprising a first structure storing, for each of a plurality of character 
positions, frequently occurring characters in that character position, and statistics 
for each frequently occurring character, 

retrieving said statistics in response to said pattern matching predicate 
based upon the character positions of characters in said pattern matching 
predicate, and 

generating said statistic based upon said retrieved character statistics. 

2. (original) The method of claim 1 wherein said data storage structure further 
stores a count of a number of occurrences of a frequently occurring character. 

3. (original) The method of claim 1 wherein said data storage structure further 
comprises a second structure storing frequently occurring characters that are 
subsequent to the frequently occurring characters stored in said first structure. 

1 I 41 (dlfiginaL) . . .The. mdthodof claint S^erein, said jsecogd .dataNtorage.struciure F 

further stores a probability of occurrence of a frequently occurring subsequent 
character. 
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5 . (original) The method of claim 1 wherein generating said statistic based 
upon said retrieved character statistics comprises estimating a statistic for a 
desired character and desired character position, in the event said statistic is not 
stored in said first structure. 

6. (original) The method of claim 5 wherein said estimating comprises 
accumulating statistics for said desired character position, for characters other 
than said desired character, and calculating average frequency of occurrence of 
characters that do not have statistics stored in said first structure. 

7. (original) The method of claim 1 wherein said first structure is a table. 

8 . (original) The method of claim 1 wherein said first structure comprises a 
linked list. 

9. (original) The method of claim 3 wherein said second structure comprises a 
table. 

10. (original) The method of claim 3 wherein said second structure comprises a 
linked list. 

1 1 . (original) A computer system implementing a relational database system 
and generating a statistic for a pattern matching predicate on an attribute of a 
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relation of said relational database, to be used in optimizing execution of a query 
directed to one or more attributes of said relation, comprising 

storage for said relational database,. including a relation having a plurality 
of tuples including values for a plurality of attributes, and a data storage 
structure storing character statistics on an attribute, comprising a first structure 
storing, for each of a plurality of character positions, frequently occurring 
characters in that character position, and statistics for each frequently occurring 
character, and 

computing circuitry performing query optimization and query execution 
upon said relational database, said query optimization including generating said 
statistic for an attribute of said relation by retrieving said statistics in response to 
said pattern matching predicate based upon the character positions of characters 
in said pattern matching predicate, and generating said statistic based upon said 
retrieved character statistics. 

12. (original) The computer system of claim 1 1 wherein said data storage 
stntcture further stores a count of a number of occurrences of a frequently 
occurring character. 

13. (original) The computer system of claim 1 1 wherein said data storage 
structure further comprises a second structure storing frequently occurring 
characters that are subsequent to the frequently occurring characters stored in 
said first structure. 
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14. (original) The computer system of claim 1 3 wherein said second data 
storage structure further stores a probability of occurrence of a frequently 
occurring subsequent character. 

15. (original) The computer system of claim 1 1 wherein said computing 
circuitry generates said statistic based upon said retrieved character statistics by 
estimating a statistic for a desired character and desired character position, in the 
event said statistic is not stored in said first structure. 

1 6. (original) The computer system of claim 1 5 wherein said computer circuitry 
performs said estimating by accumulating statistics for said desired character 
position, for characters other than said desired character, and calculating average 
frequency of occurrence of characters that do not have statistics stored in said 
first structure. 

17. (original) The computer system of claim 1 1 wherein said first structure is a 
table. 

18. (original) The computer system of claim 1 1 wherein said first structure 
comprises a linked list. 

19. (original) The computer system of claim 13 wherein said second structure 
comprises a table. 
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20. (original) The computer system of claim 1 3 wherein said second structure 
comprises a linked list. 

21 . (original) A program product for implementing a relational database system 
and generating a statistic for a pattern matching predicate on an attribute of a 
relation of said relational database, to be used in optimizing execution of a query 
directed to one: or more attributes of said relation, comprising 

a relational database, including a relation having a plurality of tuples 
including values for a plurality of attributes, 

a data storage structure storing character statistics on an attribute, 
comprising a first structure storing, for each of a plurality of character positions, 
frequently occurring characters in that character position, and statistics for each 
frequently occurring character, and 

relational database software performing query optimization and query 
execution upon said relational database, said query optimization including 
generating a statistic for an attribute of said relation by retrieving said statistics 
in response to said pattern matching predicate based upon the character positions 
of characters in said pattern matching predicate, and generating said statistic 
based upon said retrieved character statistics, and 

a signal bearing media holding said relational database and relational 
database software. 
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22. (original) The program product of claim 21 wherein the signal bearing 
media comprises transmission media. 

23. (original) The program product of claim 21 wherein the signal bearing 
media comprises recordable media. 
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Evidence Appendix 

None. 
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Related Proceedings Appendix 



None. 
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